Association Analysis of Semi-structured Data for Discrimination Discovery in Business

نویسندگان

  • Binh Luong Thanh
  • Franco Turini
چکیده

Data mining techniques have taken a critical role in life in numerous domains such as consumer analytics, finance, banking, medicine, biology, and astronomy... Recently, data mining techniques have found their application also in discovering illegal discriminatory treatment on the bases of sensitive attributes such as race, color, religion, nationality, gender, age... In this paper, we propose a framework to solve the discrimination matter in the context of semi-structured business data, and in particular in the calculation of taxes for imported goods. This framework is able to discover possibly discriminatory relations among data by finding discriminatory association rules with the support of a common sense knowledge base and text mining techniques. The framework has applied to the problem of HTS (US Harmonized Tariff Schedule) showing some satisfactory results.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Discriminatory Association Analysis on Semi-structured Data

Data mining has been applied to the discovery of illegally discriminatory treatments caused by protected-by-law attributes such as race, gender, age, etc. In this paper, we propose an improvement for the previous work of exploring discrimination in semi-structured business data. The main idea is that discrimination represented in the form of association rules is judged by opposite patterns whos...

متن کامل

Discovering Association Rules in Semi-structured Data Sets

The discovery of association rules is one of the classic problems of data mining. Typically, it is done over well-structured data, such as databases. In this paper, we present a method of discovery of association rules in semi-structured data, namely, in a set of conceptual graphs. The method is based on conceptual clustering of the data and constructing of a conceptual hierarchy. A feature of ...

متن کامل

Mining Association Rules from Semi-Structured Data

Despite the growing popularity of semi-structured data such as Web documents, most knowledge discovery research has focused on databases containing well structured data. In this paper, we try to find useful information from semistructured data. In our approach, we begin by representing semi-structured data in a prototype-based approach. We then detect the most typical common structure of semist...

متن کامل

Identifing Implementation Requirements of Massive Open Online Course in Payam Noor University from an Economic Perspective

The aim of present research was to identify Implementation requirements of Massive Open Online Course (MOOC) in Payam Noor University from an Economic perspective. The methodology used in this study was applied and the method of data collection was qualitative. The components used were based on the documentation and semi-structured interview tools. Inductive content analysis was used in three l...

متن کامل

Designing the Business Model of the Sports Academies (Case Study: National Academy of Gymnastics)

The purpose of this study is to design a business model in sports academies from the perspective of experts of this field. The present study is based on the paradigm of interpretive research, in terms of practical purpose, qualitative research approach and data collection in the form of in-depth and semi-structured interviews. The statistical population of the study includes entrepreneurs, spor...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010